Group-Based Atrous Convolution Stereo Matching Network

نویسندگان

چکیده

Stereo matching is the key technology in stereo vision. Given a pair of rectified images, determines correspondences between images and estimate depth by obtaining disparity corresponding pixels. The current work has shown that estimation from can be formulated as supervised learning task with an end-to-end frame based on convolutional neural networks (CNNs). However, 3D CNN puts great burden memory storage computation, which further leads to significantly increased computation time. To alleviate this issue, atrous convolution was proposed reduce number operations via relatively sparse receptive field. field makes it difficult find reliable points fuzzy areas, e.g., occluded areas untextured owing loss rich contextual information. address problem, we propose Group-based Atrous Convolution Spatial Pyramid Pooling (GASPP) robustly segment objects at multiple scales affordable computing resources. main feature GASPP module set layers continuous dilation rate each group, so impact holes introduced network performance. Moreover, introduce tailored cascade cost volume pyramid form memory, meet real-time group-based evaluated street scene benchmark KITTI 2015 Scene Flow achieves state-of-the-art

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rethinking Atrous Convolution for Semantic Image Segmentation

In this work, we revisit atrous convolution, a powerful tool to explicitly adjust filter’s field-of-view as well as control the resolution of feature responses computed by Deep Convolutional Neural Networks, in the application of semantic image segmentation. To handle the problem of segmenting objects at multiple scales, we design modules which employ atrous convolution in cascade or in paralle...

متن کامل

Pyramid Stereo Matching Network

Recent work has shown that depth estimation from a stereo pair of images can be formulated as a supervised learning task to be resolved with convolutional neural networks (CNNs). However, current architectures rely on patch-based Siamese networks, lacking the means to exploit context information for finding correspondence in illposed regions. To tackle this problem, we propose PSMNet, a pyramid...

متن کامل

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

Spatial pyramid pooling module or encode-decoder structure are used in deep neural networks for semantic segmentation task. The former networks are able to encode multi-scale contextual information by probing the incoming features with filters or pooling operations at multiple rates and multiple effective fields-of-view, while the latter networks can capture sharper object boundaries by gradual...

متن کامل

MCGE: multi-candidate based group evolution in stereo matching

This paper addresses the subject of stereo matching between the corners of two perspectives. Similarity-based matching is prone to errors, and the existing algorithms reject outliers but never correct them. Frequently, in fact, the correct correspondence is not found at the single point with the largest similarity; but it lies among a few points with large value. So we propose a new algorithm, ...

متن کامل

o - SEGMENT - BASED STEREO MATCHING

Images are :wo dimensional projections of three dimensional scenes, therefore depth recovery is a crucial problem in Image Understanding, with applications in passive navigation, cartography, surveillance, and industrial robotics. Stereo analysis provides a more direct quantitative depth evaluation than techniques such as shape from shading, and its being passive makes it more applicable than a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Wireless Communications and Mobile Computing

سال: 2021

ISSN: ['1530-8669', '1530-8677']

DOI: https://doi.org/10.1155/2021/7386280